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(Constrained) Quantization Without Tears 

R. Jackiw 



ABSTRACT 

An alternative to Dirac's constrained quantization procedure is explained. 

To accomplish conventional and elementary quantization of a dynamical system, one is instructed to: 
begin with a Lagrangian, eliminate velocities in favor of momenta by a Legendre transform that determines 
the Hamiltonian, postulate canonical brackets among coordinates and momenta and finally define dynamics 
by commutation with the Hamiltonian. But this procedure may fail for several reasons: it may not be 
possible to solve for the velocities in terms of the momenta, or it may be that the Hamiltonian equations 
do not reproduce the desired dynamical equations. In such cases one is dealing with so-called "singular" 
Lagrangians and "constrained" dynamics. Almost half a century ago Dirac developed his method for handling 
this situation 1 and since that time the subject has defined an area of specialization in mathematical physics, 
as is put into evidence by a recent monograph 2 and by this series of workshops 3 . 

While Dirac's approach and subsequent developments can cope with most models of interest, my col- 
league Ludwig Faddeev and I realized that in many instances Dirac's method is unnecessarily cumbersome 
and can be streamlined and simplified. We have advertised 4 an alternative approach, based on Darboux's 
theorem, wherein one arrives at the desired results — formulas for brackets and for the Hamiltonian - 
without following Dirac step by step. 

Very specifically, two aspects of the Dirac procedure are avoided. First, when it happens that the 
Lagrangian L depends linearly on the velocity £ 4 for one of the dynamical variables £*, or even is independent 
of £*, the attempt to define the canonical momentum II, = and to eliminate & in favor of obviously 
fails. In the Dirac procedure, one nevertheless defines a canonical momentum and views the ^-independent 
expression % as a constraint on n^. In our method, such constraints are never introduced. Second, in 
the Dirac procedure constraints are classified and distinguished as first class or second class, primary or 
secondary. This distinction is not made in our method; all constraints are held to the same standard. 
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It is therefore clear that our approach eliminates useless paperwork, and here I shall give a description 
with the hope that this audience of specialists will appreciate the economy of our proposal and will further 
adopt and disseminate it. 

I shall use notation appropriate to a mechanical system, with coordinates labeled by {i,j, . . .} tak- 
ing values in a set of integers up to N, and a summation convention for repeated indices. Field theoretic 
generalization is obvious: the discrete quantities {i,j, . . .} become continuous spatial variables. Time depen- 
dence of dynamical variables is not explicitly indicated since all quantities are defined at the same time, but 
time-differentiation is denoted by an over-dot. Although the language of quantum mechanics is used, with 
h scaled to unity, ("commutation," etc.) ordering issues are not addressed — so more properly speaking 
we are describing a classical Hamiltonian reduction of dynamics. Grassmann variables are not considered, 
since that complication is a straightforward generalization. Finally total time derivative contributions to 
Lagrangians are omitted whenever convenient. 

Our starting point is a first-order Lagrangian formulation for the dynamics of interest; i.e. we assume that 
the Lagrangian is at most linear in time derivatives. This is to be contrasted with the usual approach, where 
the starting point is a second-order Lagrangian, quadratic in time-derivatives, and a first-order Lagrangian is 
viewed as "singular" or "constrained." In fact, just because dynamics is described by first-order differential 
equations, it docs not mean that there are constraints, and this is a point we insist upon and we view the 
conventional position to be inappropriate. 

Indeed there are many familiar and elementary dynamical systems that are first-order, without there 
being any constraints: Lagrangians for the Schrodinger equation and the Dirac equation are first-order in 
time derivatives; in light-cone quantization, where x + = (t + x) is the evolution coordinate, dynamics is 
first-order in this "time;" the most compact description of chiral bosons in two space-time dimensions is first 
order in time 5 . It is clear that characterizing any of these systems as "singular" or "constrained" reflects 
awkward mathematics rather than physical fact. 
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Moreover, a conventional second order Lagrangian can be converted to first-order form by precisely the 
same Legendre transform used to pass from a Lagrangian to a Hamiltonian. The point is that the formula 

H=^4-L (1) 
oq 

dL 

(2) 

Oq 

may also be read in the opposite direction, 

L(p,q) =pq~ H(p,q) (3) 

and it is straightforward to verify that Euler-Lagrange equations for the first-order Lagrangian L(p, q) coin- 
cide with the Hamiltonian equations based on H(p,q). Thus given a conventional Hamiltonian description 
of dynamics, we can always construct a first-order Lagrangian whose configuration space coincides with the 
Hamiltonian phase space. 

We begin therefore with a general first-order Lagrangian. 

L = a t (OC - V(0 (4) 

Note that at has the character of a vector potential (connection) for an Abelian gauge theory, in that 
modifying by a total derivative a* — > di + -J^6 does not affect dynamics, since the Lagrangian changes 
by a total time-derivative. Observe further that when a Hamiltonian is defined by the usual Legendre 
transform, velocities are absent from the combination j^r€ — L, since L is first order in £ l , and V may be 
identified with the Hamiltonian. 

dL 

H=^?-L = V (5) 
Thus the Lagrangian in (4) may be presented as 

L = ai (0e - H(0 (6) 

and the first term on the right side defines the "canonical one-form" d£_ l = a(£). 

To introduce our method in its simplest realization, we begin with a special case, which in fact will be 
shown to be quite representative: instead of dealing with a general we take it to be linear in £ 4 . 

= (7) 
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The constant matrix Wjj is anti-symmetric, since any symmetric part merely contributes an irrelevant total 
time-derivative to L and can be dropped. The Euler-Lagrange equation that follows from (6) and (7) is 

<*n? = -^m) (8) 

The development now goes to two cases. The first case holds when the anti-symmetric matrix 
possesses an inverse, denoted by w 4 - 7 , in which case uiij must be even-dimensional, i.e. the range N of 
{i, j, . . .} is 2n = N. It follows from (7) that £ l satisfies the evolution equation 

?=J:iJLlI(Z) (9) 

and there are no constraints. Constraints are present only in the second case, when has no inverse, and 
as a consequence possesses N' zero modes z\ , a = 1, . . . , N'. The system is then constrained by N' equations 
in which no time-derivatives appear. 

*-^ff(0 = o (io) 

On the space orthogonal to that spanned by the {z a }, u>ij possesses an even-dimensional (= 2n) inverse, so 
in this case N = 2n + N'. 

For the moment we shall assume that Wy does possess an inverse and that there are no constraints. The 
second, constrained case will be dealt with later. 

With the linear form for cii(£) and in the absence of constraints all dynamical equations are contained 
in (9). Brackets are defined so as to reproduce (9) by commutation with the Hamiltonian. 

This implies that we should take 

[e,e']=^ (lla) 

or for general functions of £ 
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It is reassuring to verify that a conventional dynamical model, when presented in the form (3), is a special 
case of the present theory with comprising the two-component quantity (^) and w%j the anti-symmetric 
2x2 matrix ey, en = 1- Eq. (lib) then implies \q, p\ = i. 

Next let us turn to the more general case with aj(£) an arbitrary function of not depending explicitly 
on time. The Euler-Lagrange equation for (6) is 



d_ 

di 1 



fij(Z)£ j = (12) 



where 



Qfi a i® ~ 8$*® (13) 

fij behaves as a gauge invariant (Abclian) field strength (curvature) constructed from the gauge-variant 
potential (connection). It is called the "symplectic two-form," | fij(£) d£ l d^ = /(C); evidently it is exact: 
/ = da, and therefore closed: df = 0. In the non-singular, unconstrained situation the anti-symmetric NxN 
matrix fij has the matrix inverse / y , hence N = 2n, and (12) implies 

C = f ij -^jH(0 (14) 

This evolution equation follows upon commutation with H provided the basic bracket is taken as 

£\ e =if j (0 (15) 

The Bianchi identity satisfied by fij ensures that (15) obeys the Jacobi identity. 

The result (15) and its special case (lib) can also be derived by an alternative, physically motivated 
argument. Consider a massive particle, in any number of dimensions, moving in an external electromagnetic 
field, described by the vector potential dj(£) and scalar potential V(£). The Lagrangian and Hamiltonian 
are expressions familiar from the theory of the Lorentz force, 

L=\mee + ai(t)e-V(Q (16a) 
H=±-{ Pl -a l (i)) 2 + V(0 (16b) 
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with pi conjugate to £\ It is seen that (4), (5) and (6) correspond to the m — > limit of (16a) and (16b). 
Owing to the 0(m _1 ) kinetic term in (16b), the limit of vanishing mass can only be taken if pi — cij(£) = m£* 
is constrained to vanish. Adopting for the moment the Dirac procedure, we recognize that vanishing of m£ l 
is a second class constraint, since the constraints do not commute, 

mf , mi 1 = \pi - Oi(£), Pj - aj(0] 

= ifij(O^0 (17) 

and computing the Dirac bracket [C,^] regains (15). 

In this way we see that what one would find by following Dirac is also gotten by our method, but we 
arrive at the goal much more quickly. Also the above discussion gives a physical setting for Lagrangians of 
the form (6) : when dealing with a charged particle in an external magnetic field, in the strong field limit the 
Lorentz force term — the canonical one-form — dominates the kinetic term, which therefore may be dropped 
in first approximation. One is then left with quantum mechanical motion where the spatial coordinates fail 
to commute by terms of order of the inverse of the magnetic field. More specifically, with constant magnetic 
field B along the z-axis, energy levels of motion confined to the x-y plane form the well-known Landau bands. 
For strong fields, only the lowest band is relevant, and further effects of the additional potential V(x,y) are 
approximately described by the "Peierls Substitution" 6 . This states that the low- lying energy eigenvalues 
are 

E=^ + e n (18) 

where ^ is the energy of the lowest Landau level in the absence of V, while the e„ are eigenvalues of the 
operator V(x,y) (properly ordered!) with 

y] = ^ (19) 

Clearly the present considerations about quantizing first-order Lagrangians give a new derivation 7 of this 
ancient result from condensed matter physics. 6 [One may also verify (18) by forming mH from (16b) and 
computing e„ perturbatively in m. 8 ] 
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While the development starting with arbitrary a,(£) and unconstrained dynamics appears more general 
than that based on the linear, special case (7), the latter in fact includes the former. This is because by 
using Darboux's theorem one can show that an arbitrary vector potential [one-form <n d^ 1 } whose associated 
field strength [two-form d(cn d£ l ) = \ d£ l is non-singular, in the sense that the matrix possesses 
an inverse, can be mapped by a coordinate transformation onto (7) with non-singular. Thus apart from 
a gauge term, one can always present as 



Oi{0 = lQ k {0ukt?^p- (20a) 



correspondingly fij(£) as 



fii - Q£i W « Qg ( 20b ^ 

and in terms of new coordinates Q l the curvature is Uij — a constant and non-singular matrix. Moreover, 
by a straightforward modification of the Gram-Schmidt argument a basis can be constructed such that the 
antisymmetric N x N matrix uiij takes the block-off-diagonal form 

^={-i 0- (2i) 

where I is the n-dimensional unit matrix (N=2n). [With these procedures one can also handle the case when 
Oj is explicitly time-dependent — a transformation to constant can still be made.] In the Appendix we 
present Darboux's theorem adopted for the present application, and we explicitly construct the coordinate 
transformation Q l (£). The coordinates in which the curvature two- form becomes (21) are of course the 
canonical coordinates and they can be renamed pt, q l , i = 1, . . . , n. 

We conclude the discussion of non-singular, first-order dynamics by recording the functional integral for 
the quantum theory. The action of (4) obviously is 

1 = J OiiOdC-J H(Odt (22) 

and the path integral involves, as usual, the phase exponential of the action. The measure however is 
non-minimal; the correct prescription is 



J UiVC det 5 



2 f jk cxpil . (23) 
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The det 5 factor can be derived in a variety of ways: One may use Darboux's theorem to map the problem 
onto one with constant canonical curvature (21), where the measure is just the Liouville measure YliZi ®C = 
YYl =l r Dp i Vq % , and the Jacobian of the transformation is seen from (20b) to be det 5 fy. Alternatively one 
may refer to our derivation based on Dirac's second class constraints, eqs. (16), (17), and recall that the 
functional integral in the presence of second class constraints involves the square root of the constraints' 
bracket 9 . By either argument, one arrives at (23), which also exhibits the essential nature of the requirement 
that fij be a non-singular matrix. 

We now turn to the second, more complicated case, where there are constraints because fij is singular. 
It is evident from the Appendix that the Darboux construction may still be carried out for the non-singular 
projection of fij, which is devoid of the zero-modes (10). This results in the Lagrangian 

L=±?w ij i i -H(t,z) (24) 

Here ujij may still be taken in the canonical form (21), but now in the Hamiltonian there are N' additional 
coordinates, denoted by z a , a = 1, . . . , N', arising from the N' zero modes of and leading to N' constraint 
equations. 

£-H(£,z) = (25) 

This is the form that (10) takes in the canonical coordinates achieved by Darboux's theorem. The constrained 
nature of the z a variables is evident: they do not occur in the canonical one-form \ £ 8 dt and there is 
no time-development for them. 

In the next step, we examine the constraint equations (25) and recognize that for the z a occurring non- 
linearly in H(£, z) one can solve (25) for the z a . [More precisely, this needs det 9 d f%^ ^ 0.] On the other 
hand, when H(£, z) contains a constrained z a variable linearly, Eq. (25) does not permit an evaluation of 
the corresponding z a , because (25) in that case is a relation among the £\ with z a absent from the equation. 
Therefore using (25), we evaluate as many z a 's as possible, in terms of £"s and other z a 's, and leave for 
further analysis the linearly occurring z a 's. Note that this step does not affect the canonical one-form in the 
Lagrangian. 
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Upon evaluation and elimination of as many z a 's as possible, we are left with a Lagrangian in the form 



where the last term arises from the remaining, linearly occurring z a 's, now renamed and the only true 
constraints in the model are the <E> fe , which enter multiplied by Lagrange multipliers A*,. To incorporate the 
constraints, it is not necessary to classify them into first class or second class. Rather we solve them, by 
satisfying the equations 



which evidently give relations among the — evaluating some in terms of others. This procedure obviously 
eliminates the last term in (26) and it reduces the number of £"s below the 2n that are present in (26); also 
it replaces the diagonal canonical one- form by the expression a"j(£) d£,\ where i ranges over the reduced set, 
and di is a non- linear function obtained by inserting the solutions to (27) into (26). 

The Darboux procedure must now be repeated: the new canonical one-form a"i(£) d£ l , which could be 
singular, is brought again to diagonal form, possibly leading to constraint equations, which must be solved. 
Eventually one hopes that the iterations terminate and one is left with a completely reduced, unconstrained 
and canonical system. 

Of course there may be the technical obstacles to carrying out the above steps: solving the constraints 
may prove too difficult, constructing the Darboux transformation to canonical coordinates may not be possi- 
ble. One can then revert to the Dirac method, with its first and second class constraints, and corresponding 
modifications of brackets, subsidiary conditions on states, and non-minimal measure factors in functional 
integrals. 

I conclude my presentation by exhibiting our method in action for electromagnetism coupled to mat- 
ter, which for simplicity I take to be Dirac fields ip, since their Lagrangian is already first order. Also I 
include a gauge non-invariant mass term for the photon, to illustrate various examples of constraints. The 
electromagnetic Lagrangian in first-order form reads 



(26) 



$ (0 = 



(27) 
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- H M ((V - iA)V) 

dr^A (p-V--E)-Y A o} ( 28 ) 

Here A is the vector potential with B = V x A, A n the scalar potential that is absent from the symplectic 
term, p the photon mass. The matter Hamiltonian is not specified beyond an indication that coupling to A 
is through the covariant derivative, while p = Tp*ip. The Lagrangian is in the form (24); when p is non-zero 
the constrained variable A enters quadratically and 

SH 



SA (r) 

leads to the evaluation of A 



= (29) 



Ao = \ (p - V • E) (30) 
A 4 

so that the unconstrained Lagrangian becomes 

L = /drj-E- A + i^>- ^E 2 + B 2 + ,i 2 A 2 + -1 (p - V • E) 2 ^) j - H M ((V - iA)r/>) (31) 

The canonical pairs are identified as (— E, A) and (iip* , tp). In the absence of a photon mass, the Lagrangian 
(28) is of the form (26), with one Lagrange multiplier A = A . Eq. (29) then leads to the Gauss law 
constraint. 

V • E = p (32) 
To solve the constraint, we decompose both E and A into transverse and longitudinal parts, 

E = E T + ^=E (33) 
A = A T + ^=A (34) 
V • E T = V • A T = 

and (32) implies E = p. Inserting this into (28) at p 2 = 0, we are left with 

L = J dr |-E T • A T + P^=^A + i^V - \ (e | + B 2 - p^p 

- H M ( (V - iA T - i A^j (35a) 
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While the constraint has been eliminated, the canonical one-form in (35a) is not diagonal. The Darboux 
transformation that is now performed replaces ip by (cxpi^=^ Aj ip. This has the effect of canceling 
p— == A against a contribution coming from iip*ip and eliminating A from the Hamiltonian (since B = 
V x At)- We are thus left with the Coulomb-gauge Lagrangian 

L = J dr | -E T • A T + iip*ip - ^E| + B 2 - p^ p) j - H M ((V - iA T ) ip) (35b) 

without ever selecting the Coulomb gauge! The canonical pairs are (— Et, At) and (iip*,ip). 

We recall that the Dirac approach would introduce a canonical momentum n conjugate to A n and 
constrained to vanish. The constraints (30) or (32) would then emerge as secondary constraints, which must 
hold so that [H, n ] vanish. Finally a distinction would be made between the p ^ and p — theories: 
in the former the constraint is second class, in the latter it is first class. 9 None of these considerations are 
necessary for successful quantization. 

Our method also quantizes very efficiently Chern-Simons theories, with or without a conventional kinetic 
term for the gauge field 10 [indeed the phase space reductive limit of taking the kinetic term to zero, as in (16), 
(17) above, can be clearly described 11 ] as well as gravity theories in first order form, be they the Einstein 
model 12 or the recently discussed gravitational gauge theories in lower dimensions 13 . 

Finally, we record a first order Lagrangian L for Maxwell theory with external, conserved sources 
{Pi j)) P + V • j = 0, which depends only on field strengths (E, B) (rather than potentials) and is self-dual 
in the absence of sources. 

L = J drdr' (j5*(r) + /(r)) (r - r') B j (r') 

- X - J dr (E 2 + B 2 ) - j dr (x^p - V • E) + A 2 V • b) (36) 

^(r) Se «*^f(r) = i;e«*£ (37) 

Varying the E and B fields as well as the two Lagrange multipliers Ai^ gives the eight Maxwell equations. 
The duality transformation E — > B, B — > — E, supplemented by Ai — > — A2, A2 — ► Ai changes the Lagrangian 
by a total time derivative, when there are no sources. The canonical one-form is spatially non-local, owing 
to the presence of , which has the inverse 

u ij (r) - -e^ k d k S(r) (38) 
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when restricted to transverse fields — these are the only unconstrained degrees of freedom in (36). It then 
follows that the non-vanishing commutator is the familiar formula. 



£^(r), B^(r') = -i e ijk d k 5{r - r') 



(39) 



This self-dual presentation of electrodynamics is similar to formulations of self-dual fields on a line 5 and on 
a plane 10 . 



Appendix 

Darboux's Theorem 

We give a constructive derivation of Darboux's Theorem. Specifically we show that subject to regularity 
requirements stated below, any vector potential (connection one-form) aj(£) may be presented, apart from 
a gauge transformation, as 

ai(0 = ^Q m (0" m „^P (A.l) 
and correspondingly the field strength fa (£) (curvature two- form) as 

(AJ) 

with u) mn constant and anti-symmetric. The proof also gives a procedure for finding Q m (£). It is then 
evident that a coordinate transformation from £ to Q renders constant and a further adjustment of the 
basis puts w, 3 in the canonical form (21). 

We consider a continuously evolving transformation Q m (£; r), to be specified later, with the property 
that at t = 0, it is the identity transformation 

Q m ( £ Q) = ?m (A 3a) 

and at r = 1, it arrives at the desired <5 m (£), (which will be explicitly constructed). 

Q ro (£;l) = Q ro (0 (A.3b) 
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Q m (£; t) is generated by v m (£; r), in the sense that 

^Q m ^r)=v m (Q(^T);T) (A.4) 

Note, that v m depends explicitly on r. Also we need to define the transform by Q m (£; r) of quantities 
relevant to the argument: connection one-form, curvature two-form etc. The definition is standard: the 
transform, denoted by Tq, acts by 

dO m 

Tq Oi(0 = a m (Q) (A.5a) 
T Q fim = f m n(Q)-^--^- (A.5b) 

To give the construction, we consider the given a,(£) to be embedded in a one-parameter family a,(£; r), 
such that at r = we have ai(£) and at r = 1 we have | £ m w m i, where w m i is constant and anti-symmetric. 

ai(£;0)=aj(O (A.6a) 

ai(^l) = ^ m w™ (A.6b) 

It is then true that 

|- (T Q a,(e; r)) = T Q (l„ r) + A a^; r)) (A.7) 

where is the Lie derivative, with respect to the vector v m that generates the transformation, see (A.4). 
Eq. (A.7) is straightforwardly verified by differentiating with respect to r, and recalling that both the 
transformation and a, are T-dependent. Next we use the identity 14 

L v Oi =v n f ni + di(v n a n ) (A.8) 

and observe that when the generator is set equal to 

v n ^r) = -r(^T)^-a l (^r) (A.9) 



Eq. (A.7) leaves 



±(T Q a l ) = T Q {d l {v n a n )) (A.10) 
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Thus ^r(TQOi) is a gauge transformation, so that TQOi at r = 0, i.e. a,(£), differs from its value at r = 1, 
i.e. \Q m {£,)u mn 9C q^\ by a gauge transformation. This is the desired result, and moreover Q m (£; r) 
and Q m (£) = Q m (£; 1) are here explicitly constructed from the algebraic definition (A. 9) for v n [once an 
interpolating aj(£; r) is chosen], and integration of (A. 4) (the latter task need not be easy). 

Clearly (A. 9) requires that r) possesses the inverse r); hence both the starting and ending 

forms, fij{$,) and must be non-singular. Also fij(£; r) must remain non-singular for all intermediate r. 
In fact this is not a restrictive requirement, because one may always choose to be the value of fij(£) at 
some point £ = £ , and then by change of basis transform to any desired form. 

This description of Darboux's theorem was prepared with the assistance of B. Zwiebach, whom I thank. 
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